THE LARGE DEVIATION PRINCIPLE FOR THE 
ERDOS-RENYI RANDOM GRAPH 
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Abstract. What does an Erdos-Renyi graph look like when a rare 
event happens? This paper answers this question when p is fixed and 
n tends to infinity by establishing a large deviation principle under an 
appropriate topology. The formulation and proof of the main result 
uses the recent development of the theory of graph limits by Lovasz and 
coauthors and Szemeredi's regularity lemma from graph theory. As a 
basic application of the general principle, we work out large deviations 
for the number of triangles in G(n,p). Surprisingly, even this simple 
example yields an interesting double phase transition. 



1. Introduction 

1.1. The Erdos-Renyi graph. Let G(n,p) be the random graph on n 
vertices where each edge is added independently with probability p. This 
model has been the subject of extensive investigations since the pioneering 
work of Erdos and Renyi |14j , yielding a large body of literature (see [2j [18] 
for partial surveys). 

This paper studies the following basic aspect of Erdos-Renyi graphs: 
What does the graph look like if one knows that some rare event has hap- 
pened? One way to comprehensively answer this question is to formulate 
a large deviation principle for the Erdos-Renyi graph, in the same way as 
Sanov's theorem [27] gives a large deviation principle for an i.i.d. sample. 

The setting of Sanov's theorem conforms naturally to the abstract theory 
of large deviations (see Chapter 6 in [11]) because i.i.d. samples can be 
thought of as random probability measures, allowing them to be viewed 
as random elements of a single topological space irrespective of the sample 
size. The first hurdle in formulating such a program for random graphs is in 
constructing a single abstract space in which all graphs can be embedded. 
Fortunately, this issue has been settled recently. In a sequence of papers 
0[6l[7J[I3[IS[2Ol[2ll[22l[23l[Ml[25] Laszlo Lovasz with coauthors (listed 
here in order of frequency) V. T. Sos, B. Szegedy, C. Borgs, J. Chayes, 
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K. Vesztergombi, A. Schrijver and M. Freedman have developed a beautiful, 
unifying limit theory. (See also the related work of Diaconis and Janson [12] 
which traces this back to work of Aldous pQ and Hoover [17].) This sheds 
light on topics such as graph homomorphisms, Szemeredi's regularity lemma, 
quasi-random graphs, graph testing and extremal graph theory, and has even 
found applications in statistics and related areas (see e.g. [10] ). Their theory 
has been developed for dense graphs (number of edges comparable with the 
square of number of vertices) but parallel theories for sparse graphs are 
beginning to emerge [3]. 

1.2. Graph limits and graphons. The limit of a sequence of dense graphs 
can be defined as follows. We quote the definition verbatim from [22] (see 
also PEl Q2]). Let G n be a sequence of simple graphs whose number of 
nodes tends to infinity. For every fixed simple graph H, let |hom(iJ, G)\ 
denote the number of homomorphisms of H into G (i.e. edge-preserving 
maps V(H) — > V(G), where V(H) and V(G) are the vertex sets). This 
number is normalized to get the homomorphism density 

m hh r\ — \ hom (H,G)\ 

(1) t{H ' G) - \v{G)\\ym- 

This gives the probability that a random mapping V(H) — > V(G) is a ho- 
momorphism. 

Suppose that the graphs G n become more and more similar in the sense 
that t(H, G n ) tends to a limit t(H) for every H. One way to define a limit 
of the sequence {G n } is to define an appropriate limit object from which the 
values t(H) can be read off. 

The main result of [22] (following the earlier equivalent work of Aldous [1] 
and Hoover [T7]) is that indeed there is a natural "limit object" in the form 
of a function / £ W, where W is the space of all measurable functions from 
[0, l] 2 into [0, 1] that satisfy f(x,y) = f(y,x) for all x,y. 

Conversely, every such function arises as the limit of an appropriate graph 
sequence. This limit object determines all the limits of subgraph densities: 
if H is a simple graph with V(H) = [k] = {1, . . . , k}, then 



(2) t(H,f)= I \{ f(xi, 



^ k (i,j)£E(H) 



Xj) dxi ■ ■ ■ dxk- 



Here E(H) denotes the edge set of H. A sequence of graphs {G n } n >i is said 
to converge to / if for every finite simple graph H, 

(3) lim t(H,G n ) = t(H,f). 

n— >-oo 

Intuitively, the interval [0, 1] represents a 'continuum' of vertices, and f(x, y) 
denotes the probability of putting an edge between x and y. For example, 
for the Erdos-Flenyi graph G(n,p), if p is fixed and n — > oo, then the limit 
graph is represented by the function that is identically equal to p on [0, l] 2 . 
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These limit objects, i.e. elements of W, are called 'graphons' in [22j \6\ IT]. 
A finite simple graph G on {1, . . . , n} can also be represented as a graphon 
f G is a natural way, by defining 

G Jl if ( \nx\ , [ny] ) is an edge in G, 

( 4 ) f {x,y) = < 

I otherwise. 

Note that this allows all simple graphs, irrespective of the number of vertices, 
to be represented as elements of a single abstract space, namely W. 

1.3. The cut metric. With the above representation, it turns out that the 
notion of convergence in terms of subgraph densities outlined above can be 
captured by an explicit metric on W, the so-called 'cut distance' (originally 
defined for finite graphs by Frieze and Kannan [16J). We start with the space 
W of measurable functions f{x, y) on [0, l] 2 that satisfy < f(x, y) < 1 and 
f(x,y) = f(y,x). We define the cut distance 



(5) d n (f,g) := sup 

5,TC[0,1] 



[f(x,y) - g(x,y)]dxdy 

SxT 



We introduce in W an equivalence relation. Let E be the space of measure 
preserving bijections a : [0, 1] — > [0, 1]. Say that f(x, y) ~ g(x, y) if f(x, y) = 
g CT (x,y) := g(ax,ay) for some a € E. Denote by g the closure in (W, da) 
of the orbit {g a }. The quotient space is denoted by W and r denotes the 
natural map g —>g. Since da is invariant under a one can define on W, the 
natural distance Sa by 

<fa(/,flO := inf d n (/,0 CT ) = inf d n (/ CT ,#) = inf d a (f ai , g a2 ) 

cr a 0"i,(T2 

making (VV, 5a) into a metric space. To any finite graph G, we associate f G 
as in ([4]) and its orbit G = rf = f G € W. One of the key results of [6] is 
the following: 

Theorem 1.1 (Theorem 3.8 in [6]). A sequence of graphs {G n } n >i con- 
verges to a limit f € W in the sense defined in ([3]) if and only if 5a{G n , f) — > 
as n — > oo. 

Szemeredi's regularity lemma and the related deep results of Lovasz and 
Szegedy will play a crucial role in this paper: 

1.4. Szemeredi's lemma. Let G = (V, E) be a simple graph, and let X, Y 
be subsets of V. Then we denote by ec(X, Y) the number of X-Y edges of 
G (edges whose endpoints belong tolfiy are counted twice), and call 

e G (X,Y) 



Pg(X,Y) :-- 



\X\\Y\ 



the density of the pair (X, Y). Given some e > 0, we call a pair (A, B) of 
disjoint sets A,BCV e-regular if all X C A and Y C B with \X\ > e\A\ 
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and \Y\ > e\B\ satisfy 



p G (X,Y)-p G (A,B)\<e. 



A partition {Vq,...,Vk} of V is called an e-regular partition of G if it 
satisfies the following two conditions: 



(iii) all but at most eK 2 of the pairs (Vi,Vj) with 1 < i < j < K are 
e-regular. 

Szemeredi's regularity lemma goes as follows. 

Theorem 1.2 (Szemeredi's lemma [SB])- Given e > and an integer m > 1 
i/iere exists an integer M = M(e, m) suca £/iat every graph of order at 
least M admits an e-regular partition {Vo, . . . , Vk} for some K in the range 
m < K < M . 

This result was proved by Szemeredi |28j in 1976 and has since found nu- 
merous applications in combinatorics, number theory and many other areas 
of discrete mathematics. The version presented above is from Diestel |13j . 
Section 7.2. Lovasz and Szegedy proved the following related result. 

Theorem 1.3 (Theorem 5.1 in [23]). The metric space (W,<5n) is compact. 




2.1. The rate function. The main goal of this paper is to prove a large 
deviation principle for G(n,p) when p is fixed and n — > oo. The discussion 
in Section [T] gives a topological space (namely, W) that is suitable for this 
purpose. The next step is to define a rate function on this space. Let 
Ip : [0, 1] ->• R be the function 



(i) | Vol < en; 

(ii) = \V 2 



\Vk\; 




2. The main result 



(6) 



I p (u) : = -nlog- + -(1 



u) log 

1-p 



= - sup au + 6(1 

2 a ,6eM L 



u) - log (pe a + (1 - p)e b ) 
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The domain of the function I p can be extended to W as 



I p (h) 
(7) 
(8) 



1 r l 



JO 

1 



JO 



I p (h(x,y)) dx dy 



h(x,y)log h (1 

V 



h(x,y)) lot 



1 - h(x,y) 



l-p 



dxdy 



1 

2 "(v) 



a(x, y)h(x, y) dx dy 

\og(pe a{x ' y) + (l-p)) dxdy 



The following property of I p is crucial. 

Lemma 2.1. The function I p is well defined on W and is lower semicon- 
tinuous under the cut metric Su onW. 

Proof. The supremum in (|8|) can be taken over all bounded measurable 
functions a on [0, l] 2 . As the supremum of a family of affine linear functionals 
continuous in the weak topology, I p (h) is lower semi-continuous in the weak 
topology and therefore also in the topology of the metric da ■ If o : [0, 1] — > 
[0, 1] is a measure preserving bijection then I p (h a ) = 1(h). By lower semi- 
continuity of g & h, I p (g) < I p (h). But g G h implies h G g so that 
Ip(h) < Ip(g)- Hence, I p (g) = I p (h) and I p ( ) is well defined and lower 
semi-continuous on W. □ 



2.2. The Large Deviation Principle. The random graph G(n,p) induces 
probability distributions P njP on the space W through the map G — > f G and 
P n>p on W through the map G — > f G — > / G = r/ G . The space W is compact 
in the weak topology and a large deviation principle for P njP on W in the 
weak topology with the lower-semicontinuous rate function I p (h) given by 
([7]) is elementary but is not of much use since quantities like 'triangle counts' 
are not stable in the weak topology. We will state it for the record and find 
a use for it later. 

Theorem 2.2. The sequence Pn,p on W satisfies a large deviation principle 
in the weak topology. That is, for every weakly closed set F C W 

limsup^logP„, p (F) < - inf I p (f) 

n— >oo n feF 

and for any open set U (again in the weak topology) in W 




Proof. The weak topology is defined through an arbitrary but finite number 
of linear functionals. Therefore the large deviation principle can be reduced 
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to the large deviation behavior of a finite set of linear functionals {Z^(f)} 
given by 

Zf(f) = {</>,/) := 
under the measure P n,p . The limit 



f(x,y)<p(x,y)dxdy 
<^P( ir II f(x,y)^)(x,y)dxdy 



lim log E^ n,p 

n— >oo n 

is easily calculated to yield 

~jj log{pe 2 ^ x ^ + (l-p))dxdy. 

(Note that this is true only if (j) is symmetric. However, since / is symmetric, 
it suffices to restrict attention to symmetric <fi.) Therefore, an abstract 
Gartner-Ellis Theorem (see e.g. Theorem 4.5.3 in [11]) gives the upper bound 
with rate function 

1 



Ipif) = sup 



Note that this is the rate function I p defined in (|7|) and 
is attained at the function 



logipe 2 *^ + {l-p))dxdy . 

. The supremum 



<Pf{x,y) :-- 
Note that for any g ^ f, 



1, f(x,y) 

-log 

2 p 



1 l-f(x,y) 
-log . 

2 1 — p 



[(<f>f,f)-W)) -({</>/, 9) -I P (JJ)) 



g(x,y) l—g(x,y) 
9{x, y) log — r + (1 -g[x,y))\o% 



dx dy > 0. 



f(x,y) ' l-f(x,y)_ 

This shows that every / is an exposed point of the lower semicontinuous rate 
function I p , in the parlance of convex analysis. Therefore by the Gartner- 
Ellis Theorem (see e.g. Theorem 4.5.20 in [11]) and the compactness of the 
weak topology, we get the lower bound. □ 

The large deviation principle for P n)P on (W, 5\j) is much more useful and 
is the main result of this article. 

Theorem 2.3. For each fixed p £ (0, 1), the sequence ^n,p obeys a lavge 
deviation principle in the space W (equipped with the cut metric) with rate 
function I p defined by (|7|). Explicitly, this means that for any closed set 



0) 



1 



lim sup — T logP„ )P (F) < - jnf I p (h). 



n 



and for any open set U C W, 



(10) 



1 



liminf-2 logP„ iP (i7) > - jnf I p (h). 



n— >oo n 
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For the upper bound, because (W, Sn) is compact, it is sufficient to prove 
that for any h € W, 

1 „ 

limlimsup^jlogP niP (Sh(/i,77)) < -I p (h). 

For the lower bound we need to prove that if h G W and 77 > is arbitrary 

liminf — logP n ,p(S n (M)) > 
where S n (h,r)) = {g : 5 a (g, h) < n}. 

2.3. Proof of the upper bound in Theorem 12.31 Let B(h,rj) C W be 
defined as 

B(h,rj) = T - 1 S n (h,7 1 ) CW 
i.e. the union of all the orbits from So(h,rj) C W. We need to show that 

(11) limlimsup^logP n , p [£(M)] < -I p (h) 

Let the set of n vertices be partitioned into K subsets of size a with a 
remainder of size 6, so that n = Ka + b. We assume that b < en. We order 
the vertices so that Vq = {1,2,... b} and V{ = {b + (i — l)a + 1, . . . , b + 
ia} for i = 1,2,..., if. We map the vertices into subintervals of the unit 
interval, with the vertex r represented by the interval i 1 -^-, -]• The sets Vi 
of vertices will then correspond to the intervals Eq = [0, — ] for i = and 

Et = [ b+(i ~ 1)a , ^] for 1 < i < K. Let us denote by V K the subset of W 
consisting of g(x, y) that are of the form 

K 

a(x,y) = PiAEiix^E^y) 

where {pij}, 1 < i,j < K is symmetric and satisfies < pj,- < 1. For any 
pair m, M with m < M, we define V m ,M = ^m<K<M^K- The following is a 
restatement of the Szemeredi regularity lemma. 

Lemma 2.4. Given any e > and m > 1 suc/i £/ia£ 2m _1 < e, t/iere is 
M = M(e,m) sitc/i i/iai for any graph G, there exists a permutation it, i.e. 
a relabeling of the vertices of the graph, such that 

inf d n (r G ,g)<e. 

Proof. Let e' and m be given. According to Szemeredi's lemma, there is 
M(e', to) such that, for some K in the range m < K < M, we can find a 
partition Vo, • • • Vk which is e'-regular. After a permutation we can assume 
that the ordering of the vertices coincides with the ordering of the partitions. 
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We define p iti = p ,i = Pi,o = and for 1 < i ^ j, p itj = p G (Vi,Vj). This 
leads to 

K 

g( x ,y) = Y PG{Vi,Vj)i-E t {x)iE 3 {y) 

i+3 

to be compared with f G when Vq, . . . ,Vk is an e' regular partition of G. 
Recall that 



dn(f G ,g)= sup 

S,Tc[0,i; 



[f G (x,y) - g(x,y)]dxdy 



SxT 



Since both f G and g are constant on sets of the form [^, ^] x it is 

easy to see that S and T can be restricted to unions of intervals of the form 
[n' T 1 ] i- e - subsets of [0, 1] that represent subsets of vertices. These subsets 
will also be denoted by S and T. Now, given two such subsets S and T, 



(f G (x,y) - g(x,y))dxdy 

SxT 

= Y\ // (f G (x,y) - g(x,y))dxdy. 

o<i,j<K JJ ( SnE ^ x ( TnE ^ 

Let Aij denote the (i, j)th term in the above sum. Since f G and g both take 
values in [0, 1], therefore for each i and each j, 

\Aio\ < \Ei\\Eo\, \Aqj\ < \Eo\\Ej\. 

For the same reason, if \S n Ei\ < e'\Ei\ or \T C\ Ej\ < e'\Ej\, then 

\Aij\ < e'lEiWEj]. 

If 1 < i ^ j } \SC\Ei\ > e'\Ei\, \TC\Ej\ > e'\Ej\ and the pair (Vi,Vj) is 
e'-regular, then 

4 e G (S n v;, t n Vj) - PG (Vi, Vj)\s n E t \ \t n ^ | 



rr 



< | PG (5 n 5 n v$) - PG (Vi, Vj)\ \s n E t \ \t n ^ | 



< e'l^H^I. 



Finally, if either 1 <i = j or (Vi,Vj) is not e'-regular (of which there are at 
most K + 2e'K 2 cases), then we have the trivial bound \Aij\ < ^ < A 
combination of the above estimates gives 

\Aij\ <2\E \+e' + (K + 2e'K 2 )^ 



0<i,j<K 



K 2 



< (2e' + 2e' + e' + K' 1 ) 



Thus, 

dn(f G , g) < 5e' + K' 1 < he' + rxT 1 
Since m _1 < e/2, we can choose e' so that 5e' + mT x < e. 



□ 
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Lemma 2.5. Let e,m and M be as in Lemma \2A\ 

F n , p (B(h, V)) < n\ F ntP (B{h, n) n B(V m , M , e)) 
where B(V m ,Mi e ) = id ■ ini /eV m , M d u{g,f) < e}. 

Proof. The orbit under the permutation group has at most n\ elements and 
they all have the same probability under P n p . Moreover by the Lemma 12.41 
every orbit meets B(y m ,M,e), and B(h,rj) is invariant under a € £ and 
therefore under n. Consequently 

B(M) c|j7r- 1 (B(fc,f;)n5(V m ,jif ) €)) 

7T 

and the lemma follows. □ 

Lemma 2.6. There exists a function 8(h,e), depending only on h and e, 
with 5(h,e) — > as e — > 0, swc/i i/iat /or each arbitrary but fixed e,m,M 
satisfying Lemma [2.41 

limlimsup-slogP nJ ,(B(^,7/)nS(Vm,Af s e)) ^ -ipfi + ^e) 

Proof. Since V TOj m is a finite union U m <AXM Vif it is sufficient to prove that 
for each K 

lim limsup -^logF n , p (B(h,ri) n B(V K ,e)) < -lJh) + 5(h,e) 

and 6(h,e) — > as e — > 0. For fixed K, Vx consists of a compact set of 
functions in Li([0, l] 2 ) and can be covered by a finite number of spheres of 
radius e in L\ and therefore in W. It is therefore sufficient to show that for 
fixed K and g £ Vk 

lim lim sup -= log F n JB(h, rj) D 2e)) < -LJh) + e) 

We can assume that -B(/i, rj)C]B(g, 2e) 7^ 0. Therefore 5 G -B(/i, r/+2e). Since 
7/ — >• we can assume rj < e so that g £ B(h, 3e). By lower semi-continuity 
of I p (-), I p (f) > I p (h) - 5(h,e) on B(g,2e) C B(h,5e) and <5(7i,e) ^ as 
e — > 0. We note that B(g,2e) C W is weakly closed and therefore by the 
upper bound in Theorem I2.2| 

lim lim sup \ \ogF n . p {B(h, 77) n B(g, 2e)) < lim sup -5 logP„. p (5(fl, 2e)) 

<- inf I p (f)<- inf I p (f)<-I p (h) + S(h,e) 

feB(g,2e) feB(h,5e) 

where <5(/i, e) ->■ as e -» 0. □ 

Lemma 12.51 and Lemma 12.61 yield (|lip . which proves the upper bound in 
Theorem 12.31 
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2.4. Proof of the lower bound in Theorem 12.31 Let h(x, y) € W be 
given. We define 



Pii = n 2 / h(x, y)dxdy 

J J\i=l ±] x [l=l i] 

L n 1 n J L n ' n 1 

and the corresponding function h n (x,y) G W by 

» 



h3 



Since II h r , 



lii([o,l] 2 ) 



0, it follows that da(h n ,h) — > 0. It is therefore 



sufficient to prove that for any e > 

liminf ^logP((in(/ G(n ' p Un) < e) > -I p (h). 



We define an inhomogeneous random graph where the edge connecting the 
vertices i and j is present with probability pfj ■ Different edges are inde- 
pendent. If £j = 1 when the edge connecting i, j is present and otherwise 



then are independent Binomial random variables with P(f 
We denote by P n ^ the measure on W induced by 

f n (x,y) := y2&,jht± ±] (x)h 



1) 



(n) 



2.1(2/) 

n 1 rt ^ 



If A and -B are subsets of {1, . . . , n}, it is straightforward to calculate 



V>n(A) :=^logE p " 



ti- 



ll- 



E 1 °g EP "'' 1 exp(2Ate,,-pg ) )) 



+ 



log E p "' h 



ieAns,jr'G-BnA c 

or ieAnB c ,j£AnB 

or ieAnB c , jeA c ns 



exp(A(e M -^ ) )) 



Each term in the sum is easily estimated by 4-, providing an estimate of 
the form 



n,h 



(fn~ K)dxdy 

AxB 



> e < e 



where A = Uj G yi[^-, ^] and B is defined similarly. Since the number of sets 
like A x B is only 2 2n it follows that 

Wn,h(Mfn,K) >e) ->0 
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as n —7- oo. Now the lower bound is easily established by a simple tilting 
argument. Denoting by B e ^ n the set {/ : dn(f,h n ) < e} 

r r _, dp n,h 



di- 



ll, h 



By Jensen's inequality 

logP n , p (B e , n ) > logP n , h (S £ , n ) - - / log -^dP nth . 

Since f n ,h(Be,n) — ^ 1, it is easy to see that 

liminf^logP niP (B e n ) > - lim 4r / log 

n— Yoo n n— >oo fi J 



""l n,p 



n,h 



The entropy cost of tilting (i.e. the integral in the preceding display) is 

1 / J n) 1 - r> (n) \ 

-2 lAni lo e -f - + (1 " log ~T^~ ) 

i>j v 7 

which converges to as n — > oo. This proves the lower bound. 



3. Conditional distributions 

Theorem 12 . 31 gives estimates of the probabilities of rare events for G(n,p). 
However, it does not quite answer the following question: given that some 
particular rare event has occurred, what does the graph look like? Naturally, 
one might expect that if G(n,p) G F for some closed set F C W satisfying 

(12) inf I p (h) = mf_I p (h) > 0, 

h£F° heF 

then G(n,p) should resemble one of the minimizers of I p in F. (Here F° 
denotes the interior of F, as usual.) In other words, given that G(n,p) € F, 
one might expect that S\j(G(n,p), F*) ~ 0, where F* is the set of minimizers 
of I p in F and 

6 D (G(n,p),F*):= inf S D (G(n,p), h). 

heF* 

However, it is not obvious that a minimizer must exist in F. Here is where 
the compactness of W comes to the rescue yet one more time: since the func- 
tion Ip is lower semicontinuous on F and F is closed, therefore a minimizer 
must necessarily exist. The following theorem formalizes this argument. 
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Theorem 3.1. Take any p G (0,1). Let F be a closed subset of W satis- 
fying (fl2l) . Let F* be the subset of F where I p is minimized. Then F* is 
non-empty and compact, and for each n, and each e > 0, 

F{5 n (G(n,p),F*) > e | G{n,p) e F) < e ~ c ^ )n2 

where C(e,F) is a positive constant depending only on e and F. In particu- 
lar, if F* contains only one element h* , then the conditional distribution of 
G(n,p) given G(n,p) G F converges to the point mass at h* as n —> oo. 

Proof. Since W is compact and F is a closed subset, therefore F is also 
compact. Since I p is a lower semicontinuous function on F (Lemma 12. 1|) and 
F is compact, it must attain its minimum on F. Thus, F* is non-empty. By 
the lower semicontinuity of I p , F* is closed (and hence compact). Fix e > 
and let 

F € := {h G F : 6 D (h,F*) > e}. 
Then F e is again a closed subset. Observe that 

F(S n (G(n,p),n > e \ G(n,p) e F) = ^"'^-^ 

P(G(n,p) G F) 

Thus, with 

h := mf I p (h), h ■= Jnf I p (h), 

heF h£F e 

Theorem 12.31 and condition (|12p give 

limsup^ logP(fa(G(n,p),F*) > e | G{n,p) G F) < h - J 2 . 
n— >oo fl 

The proof will be complete if it is shown that I\ < J2. 

Now clearly, I\ <Ii- If I\ = I2 > the compactness of F e implies that there 
exists h G F e satisfying I p (h) = Ii- However, this means that h G F* and 
hence F e n F* 7^ 0, which is impossible. □ 

4. Application to triangle counts 

4.1. Brief history of the problem. Let T ntP be the number of triangles 
in G(n,p). The primary objective of this section is to compute the large 
deviation rate function for the upper tail of T n p when p remains fixed and 
n — > 00. In other words, given p G [0, 1] and e > 0, we wish to evaluate the 
limit 

(13) lim \ logP(T njP > (1 + e)E(T„ iP )) 

n— >oo n 

as a function of p and e. 

The problem of estimating tail probabilities like ¥{T n<p > (1 + e)E(T nj5 )) 
has been studied extensively in the random graphs literature, particularly 
in the case when p is allowed to tend to zero as n — > 00. Computing upper 
and lower bounds on such tail probabilities that are sharp up to constants 
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in the exponent was a prominent open problem in this area until until it 
was solved recently in [8]. Let us refer to the paper [8] for a survey of the 
aforementioned literature. 

When p is fixed, computing sharp upper and lower bounds is relatively 
easy. The difficult problem in this case is the exact evaluation of the 
limit (113p . The first progress in this direction was made in [9] where it 
was shown that, given p G (0, 1), there exist p 3 /6 < t' < t" < 1/6 such that 
for alii G (p 3 /6,t')U(t", 1/6), 

(14) lim \ logP(T n)P > tn 3 ) = -/ p ((6i) 1/3 ), 

when is the entropy function defined in ([6]). Explicit formulas for p' and 
p" are also given in [SJ. Unfortunately, the result does not cover all values 
of (p, t); and neither is the above formula true for all (p,t), as we shall see 
below. 

There is a related unpublished manuscript by Bolthausen, Comets and 
Dembo [3j on large deviations for subgraph counts. As of now, to the best 
of our knowledge, the authors of [4j have only looked at subgraphs that do 
not complete loops, like 2-stars. Besides [9 J and [4j, we know of no other 
papers that attack the exact evaluation of (|13p (or equivalently, (|14p ). 

4.2. Exact large deviations for the upper tail. In this subsection, the 
limit (]14p is evaluated for all values of p and t. It comes as the solution 
of the following variational problem. Let W, W and <5n be defined as in 
Section [TJ For each / G W, let 

T (f) ■= ^ J J J f(x,y)f(y,z)f(z,x) dx dy dz 

and let I p (f ) be defined as in (J7J). Note that T can be defined on W simply 
by letting T(f) := T(f), because T is a continuous map on W under the dn 
pseudometric (Theorem 3.7 in [B]). 
For each p G [0, 1] and t G [0, 1/6), let 

(15) 4>(P, t) := inf{/ p (/) : / G W, T(f) > t}. 

For t > 1/6, let (f)(p,t) = oo. The following result gives the large deviation 
rate function for the upper tail of T n ^ p . (Note that this is just an illustrative 
example. Theorem 12.31 can be used to derive large deviations for any sub- 
graph count, or even joint large deviations for the counts of more than one 
subgraph.) 

Theorem 4.1. Let G(n,p) be the Erdds-Renyi random graph on n vertices 
with edge probability p. Let T n . p denote the number of triangles in G(n,p). 
Let 4> be defined as above. Then for each p G (0, 1) and each t > 0, 

lim -^\o S F(T njP > tn 3 ) = -</>(p,t). 

n— >oo n 
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Next, take any p G (0, 1) and t G (p 3 /6, 1/6). Let F* t be the set of minimiz- 
es for the variational problem (fT5|) and F* t be its image in W. Then F* t is 

a non-empty compact subset of W. Moreover, for each e > there exists a 
positive constant C(e,p,t) depending only on e, p and t such that for any n, 

F(S a (G(n,p),F; jt ) > e | T n , p > tn 3 ) < e - c ^>\ 

Proof. Let F := {/ G W : T(f) > t}. By Theorem 3.7 in [6j, F is a closed 
subset of W. Therefore by Theorem 12.31 

lim sup ^ log P(T„ >P > tn 3 ) = lim sup log W(G(n,p) G F) 

n— >oo Tl n^oo Tl 

< — inf I p (h) = —(j>(p,t). 

heF 

Next, let U := {f G W : T(f) > t}. Again by Theorem 3.7 of [6], U is an 
open set. Therefore by Theorem 12.31 f° r each e > 0, 

\ \ 

lim inf ^logP(T„ p > tn 3 ) > lim inf \og¥(G{n,p) G U) 

> - inf UK) > -<f>{p,t + e). 
heu 

In Proposition 14.21 below, it is proved that is a continuous function of 
t for every fixed p. This completes the proof of the first assertion of the 
theorem. The second assertion is merely a corollary of Theorem 13.11 The 
condition (|12j) required for Theorem 13.11 can be easily shown to follow from 
the continuity of 4> m t, because any / with T(f) > t lies in the interior of 
the set {h : T(h) >t}. □ 

4.3. Properties of the rate function. Given Theorem 12.3^ there is a 
natural desire to understand the rate function eft. The following proposition 
summarizes some basic properties of 4>. The first property is required in the 
proof of Theorem 14.11 above. 

Proposition 4.2. For each fixed p G (0, 1), the following hold: 

(i) The function (j){p,t) is continuous in t in the interval [0, 1/6). 
(u) As a function of t, (j)(p,t) = in the interval [0,p 3 /G] and strictly 
increasing in (p 3 /6, 1/6). Moreover, forp 3 /6 <t < s < 1/6, 

4>(p,t) < (t/s^^s). 

(Hi) Fort G (p 3 /6,l/6), 4>{p,t) can be alternately represented as 

<P(p,t) := inf {/„(/) : / G W, T(f)=t}. 

Moreover, if {f n }n>i is a sequence in W such that T(f n ) > t for all n 
and Ip(fn) — > <t>{p,t), then T(f n ) — > t. In particular, the elements of 
F* t all satisfy T(f) = t. 
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Proof. Let us start by proving that <fi is continuous in t. For each / € W 
and 5 € [0,1], let 

:=/ + 5(i_/). 

By the inequality 

(a + 5(1 - a))(b + 5(1 - 6))(c + 5(1 - c)) 
= a6c + 5((1 - a)6c + (1 - b)ac + (1 - c)a6) 

+ 5 2 ((1 - o)(l - 6)c + (1 - a)(l - c)6 + (1 - 6)(1 - c)a) 

+ 5 3 (l-a)(l-6)(l-c) 
> a&c + 5 3 ((1 - a)6c + (1 - b)ac + (1 - c)a6) 

+ 5 3 ((1 - o)(l - b)c + (1 - o)(l - c)b + (1 - 6)(1 - c)o) 

+ 5 3 (l-a)(l-6)(l-c) 
= abc + 5 3 (1 - abc), 
we see that 

(16) T (f*)>T(f)(l-6 3 ) + ?-. 

o 

Take any i G [0, 1/6) and any / such that T(f) > t. Suppose t n 1 i. Let 5 n 
be the smallest number in [0,1] such that T(/ <5n ) > t n . By (|16|) it follows 
that S n exists and limn^oo 5 n = 0. Therefore by the dominated convergence 
theorem, ]ha n -+ oa I p (f Sn ) = I p {f). Thus, 

lim <j>{p,t n ) < lim 7 p (/ tf ") = /„(/). 

n— >-oo n— >oo 

Since this is true for every / such that T(f) > t and is a non-decreasing 
function in t, this proves the right continuity of <f>. 

Next, take a sequence t n "f" i. Let / n be a sequence of functions such that 
T(/n) > *n and 

lim Ip(f n ) = lim (p(p,t n ). 

n— >oo n— >oo 

For each n, let 8 n be the smallest number in (0,1) such that T(/^ n ) > t. 
By ([15]) . 5 n exists and limn^oo 5 n = 0. Now, the function I p on [0, 1] (defined 
in (j6j) is uniformly continuous on [0, 1]. As a consequence, 

lim sup \I p {f) - I p (f)\ =0. 



lim = lim <£(p,t n ) 



In particular, 

n— ¥oo n— >oo 

But 4>(p,t) < I p (fn n ) for each n. By the monotonicity of (f), this proves left 
continuity. 

Next, note that since P(T(G n ,p) > tn 3 ) 1 for any i < p 3 /6, TheoremO 
and the continuity of <f> imply that <f>(p, t) = for t < p 3 /6. Let us now show 
that 4>{p,t) is strictly increasing in £ when t £ (p 3 /6, 1/6). 
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Fix p 3 /6 < i < s < 1/6. Fix e > 0. Take any / € W such that T(f) > s. 
For each 6 G (0, 1), let 

(17) f s (x,y):=(l-5)f(x,y) + 5p. 
By the inequality 

((1 - 6)a + 5p)((l - 5)b + 5p)((l - S)c + Sp) > (1 - Sfabc 

we have 

T(f s )>(l-5) 3 T(f)>(l-S) 3 s. 

Thus, if we take 5 such that (1 — <5) 3 = t/s, then T(f$) > t. By the convexity 
of the function I p on [0, 1] defined in ([6]), we see that for any x > p, 

J p ((l - <5)x + 5p) < (1 - 5)/ p (x) + J/ p (p) = (1 - <5)/ p (x), 

and therefore, 

(18) I p (fs)<(l-5)I p (f). 

Thus, <f>(p,t) < Ip(fs) < (*/ s ) 1 ^ 3 ^p(/)- Since this holds for any / with 
T(f) > s, it shows that 

(19) <t>(jp,t)<{t/s)y*<i>(p,s). 

To show (ft is a strictly increasing function of £ in the interval (p 3 /6, 1/6), it 
therefore suffices to prove that <fi(p, t) > for t in this interval. This follows 
easily, since the strict convexity of I p on [0, 1] and equation (TTHl) show that 
equality in (I19D can hold only if / = p almost everywhere for some / such 
that T(f) > s, which is impossible since s > p 3 /6. 

Next, fix t € (p 3 /6, 1/6) and take any sequence {f^} in W such that 
T(f^) > t for all n and I p (f^) -4 <fi(p,t). Recall the subscript notation 
introduced in (1171) above. By the continuous mapping theorem, there exist 
S n € [0, 1] such that for each n, 

n& = *■ 

Therefore by (fTHD . 

which proves that <5 n — >■ 0. This proves that 

<t>(p,t)=mf{I p (f):f£W, T(f)=t}. 

Since 5 n — > 0, this also proves that T(f^) —> t. This completes the proof 
of Proposition S2J □ 
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4.4. The 'Replica Symmetric' phase. Note that there are two "extreme" 
functions that satisfy T(f) = t. First, there is the constant function 

(20) ct(x,y) = (et) 1 / 3 . 

On the opposite extreme, there is the function xt, defined as 

1 if max{x,y} < (Qt) 1 / 3 , 
otherwise. 



(21) Xt(x,y)r- 



In a limiting sense, c± represents an Erdos-Renyi random graph with edge 
probability (6i) 1;/3 , while xt represents the union of a clique of size n(6t) 1/ ' 3 
and a set of isolated vertices of size n(l — (6i) 1//3 ). 

It is simple to see that ()14|) holds if and only if the infimum in ()15f) is 
attained at the constant function q . The following theorem gives a sufficient 
condition for this to happen. This extends the main result of [9]. The meth- 
ods of [9] are closely related to methods from statistical physics; drawing 
inspiration from this connection, one may call the region where q solves the 
variational problem (|15p as the 'replica symmetric phase' of the problem. 



Theorem 4.3. For each < p < 1 and t G (p 3 /6, 1/6), let h p (t) := 
Ip^Gt) 1 ' 3 ) . For t € [0,p 3 /6], let h p (t) = 0. Let h p be the convex mino- 
rant of h p (i.e. the maximum convex function lying below h p ). If t is a 
point in (j> 3 /6, 1/6) where h p {t) = h p (t), then the variational problem (|15p 
for this pair (p,t) is uniquely solved by the constant function ct = (6t) 1 / 3 . 
Consequently, 4>(p,t) = h p (t). Moreover, for such (p,t), for each e > 

lim ¥(8 D (G(n,p),c t ) > e | T n , p > tn 3 ) = 0. 

n— too 

Since ct is the limit of G(n, (6t) 1 / 3 ), this means that for such (p,t) the con- 
ditional distribution of G(n,p) given T n>p > tn 3 is indistinguishable from the 
law of G(n, (Gt) 1 ^ 3 ) in the large n limit. 

Proof. Since h p is an increasing function, it is easy to see that h p (t) = h p (t) 
for t < p 3 /6. (Incidentally, this also shows that h p is an increasing function 
in [0, 1/6), and strictly increasing in (p 3 /6, 1/6).) 

Suppose t is a point in (p 3 /6, 1/6) such that h p (t) = h p (t). We claim that 
there exists f3 > such that 

(22) t = argmax^Q 1/6] (/3x - h p (x)). 

To see this, observe that since h p is convex and strictly increasing in the 
interval (p 3 /Q, 1/6), there exists /? > and c£l such that the line y = f3x+c 
lies below the curve y = h p (x) and touches it at x = t. But we also know 
that h p lies above h p and the two curves touch at t. Thus, 

t = argmax :r (/3x + c — h p (x)) = argmax :r (/3x — h p (x)). 

This proves the claim. 
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Now take any / € W such that T{f) > i. Let q be the function that 
is identically equal to ^i) 1 / 3 . Then T(cf) = t, and by Holder's inequality 
and (1221), 



Pt - I p (c t ) = JI (fatM? - I p ( Ct (x,y)) ) dx dy 
>ff - Ip(/(x, y))) dx dy 

- f /// /( x > x ) dx dy dz- II I p (f(x, y)) dx dy 



6 . 

> # - /„(/)■ 

Thus, q minimizes ip(/) among all / such that T(f) > t. This shows that 

<t>{p,t) = Ip(<k) = h p (t). 

The uniqueness of the optimizer follows from the Holder step in the above 
deduction. Finally, the claim about the conditional distribution follows from 
Theorem 13,11 and the uniqueness of the minimizer. □ 

It is easy to show that for any p > 0, h p (t) = h p (t) for all t € (p 3 /6, t') U 
(t", 1/6) where t' and t" depend on p. Similarly, given any t € (0,1/6), 
there exists p' < (6t) 1//3 depending on t such that for all p € (p f , (6t) 1 / 3 ), 
h p (t) = h p (t). Thus, there is a nontrivial set of (p,t) where cj solves the 
variational problem and consequently 4>(p,t) = h p (t). As mentioned before, 
this recovers the main result of [9]. The conclusion about the conditional 
distribution is a new result. 

4.5. Replica Symmetry Breaking. Given Theorem 14.31 it is quite in- 
teresting to note that the variational problem (115j) is not solved by con- 
stant functions everywhere. From the physical point of view espoused in [9j, 
however, this is not surprising; it is simply the effect of replica symmetry 
breaking down in the 'low temperature regime'. 

The phase transition is very easy to establish using Theorem 14. 1\ by 
comparing the performances of q and xt defined in (|20p and (|2ip . A simple 
computation shows that for any t € (0, 1/6), 

(23) limT i^L = M^>M^ = Um U*> 



p->olog(l/p) 2 2 p->o log(l/p)' 

Combining the above observation with Theorem 13.11 it follows easily that 
there are values of (p,t) such that given T niP > tn 3 , the graph G(n,p) must 
look different than an Erdos-Renyi graph. Again, compactness is crucial. 
This is formalized by the following theorem. 

Theorem 4.4. Let C denote the set of constant functions in W. For each 
t £ (0,1/6), there exists p' > such that for all p < p' , the variational 
problem (fT5j) is not solved by the constant function ct = (6t) 1//3 . Moreover, 
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for such (p,t), 5a(F*j,C) > 0, where F* t is the set of minimizers defined 
in Theorem \A.l\ Consequently, there exists e > such that 

lim ¥(8 n (G(n,p),C) >e\T np > tn 3 ) = 1. 

n— >oo 

Proof. By f)23[) we see that for each t, there exists p' > such that ct is 
not a minimizer for the problem (I15h if p < p'. Take any such (p,t). By 
Theorem l4.H is non-empty and compact. By part (Hi) of Proposition l4~2l 
and the non-optimality of q, it follows that C and F* t must be disjoint. But 

C and F* t are both compact subsets of W. Therefore, 6a(C,F*^) > 0. The 
last claim follows by Theorem 14.11 □ 

4.6. The double phase transition. A combination of Theorem 14.41 and 
Theorem 14.31 shows that for all small enough p, the variational problem (|15p 
has a 'double phase transition'. (Actually, there may be more than two 
phase transitions, but we show that there is at least two.) 

Indeed, for all small enough p, Theorem 14.31 (or the results of [9]) show 
that there exists p 3 /Q < t' < t" < 1/6 such that 4>(p,t) = 7 p ((6t) 1 / 3 ) for 
all t € (p 3 /6,t') U (t",l/6). On the other hand by Theorem [OJ it follows 
that for all small enough p, the variational problem (|15p is not solved by a 
constant function at the point (p, 1/2). Combining these two observations 
gives the following theorem. 

Theorem 4.5. There exists po > such that if p < po, then there exists 
p 3 /6 < t' < t" < 1/6 such that the variational problem (\15h is solved by the 
constant function Ct = (6£) 1//3 when t € (p 3 /6,t') U (i",l/6) ; but there is a 
non-empty subset of (t', t") where all optimizers are non-constant. 

Of course, as shown by Theorems 14.31 and 14.41 the significance of optimiz- 
ers being constant or non-constant is in whether the conditional behavior of 
G(n,p) given T n ^ p > tn 3 is close to that of an Erdds-Renyi graph or not. 

4.7. The small p limit. The last theorem of this paper describes the nature 
of 4>(p, t) and F* t when t is fixed and p is very small, tending to zero. The 
essence of the result, perhaps not surprisingly, is that when t is fixed and 
p — > 0, then conditionally on the event {T njP > tn 3 } the graph G(n,p) must 
look like a clique. 

Theorem 4.6. For each t G (0, 1/6), 

]im ^) _ 050* 

P ^o log(l/p) 2 

Moreover, if \t is the function defined in (|2ip and F* t is defined as in 

Theorem 14.1). then the set F* t converges to the point xt as p ^ 0, in the 
sense that 

lim sup S a (f,xt) = 0. 
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Consequently, for each e > 0, 

lim lim F(5 D (G(n,p),xt) > e | T n , p > tn 3 ) = 0. 

jj->0 n->oo 

Proof. In this proof, C will denote any constant that does not depend on 
anything else. The value of C may change from line to line. All integrals 
will be over the interval [0, 1]. 

Fix t G (0, 1/6). For each p < ^i) 1 / 3 , choose a function f p € F* t . From 
the definition of I p , observe that if p < 1/2, 



Wp) ~ o / / /p( x > lo s - dx d V 



(24) 



On the other hand, by the definition of F* t , 



P 



< C. 



I p (f P )<I P (Xt)<^^log- + C. 

z p 



Combining the last two inequalities gives 



(25) 

Next, let 



log(l/p)" 



J J f p (x,y)dxdy<(6t) 2 / 3 + 

h p (x,y) - J f p (x,z)f p (z,y)dz. 



Then by two applications of the Cauchy-Schwarz inequality, the inequal- 
ity ([25]) . and the fact that f p (x,y) € [0, 1], we get the following important 
sequence of inequalities. 



(26) 



(6t) 2 = (6T(/ P )) S 



h P (x,y)f p (x,y) dx dy 



< 



< 



h p (x,y)dxdy // f p (x,y)dxdy 



f p (x,z)dz / f p (y,z)dz) dxdy II f p (x,y)dxdy 



f P {x,y)dxdy 



< 



f p (x,y)dxdy) < [ (6t) 2 / 3 + 



C 



11 ' 1 log(l/p) 

A direct consequence of (|26p . combined with (|24p. is that 



lim 



4>{p,t) 



lim W,) 



lim 



1 



f p (x,y)dxdy 



(6t) 2 / 3 



p^o log ( 1 jp) p-+o log(l/p) p-5-o 2 

which proves the first assertion of the theorem. A second important conse- 
quence of (f2"6"j) . to be useful later, is that 

(27) If f P (x,y)(l - f p (x,y))dxdy < j^^. 
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Next, note that for any x, y 
1 
2 



if P (x, z)f p (z', y) - f p (x, z')f p (z, y)) dz dz' 

fp(x,z)fp(z',y)dz dz' - h 2 p (x,y) 
f 2 (x,z)dz / f 2 (y,z)dz - hl(x,y). 



It follows from this and (I26H that 



< 



(fp{x,z)f p (z',y) - f p (x,z')f p (z,y)) 2 dzdz' dxdy \\ f 2 (x,y)dxdy 
C 



(28) 




log(l/p) 

The above inequality and the lower bound on ff f p (x, y)dxdy from (|26p give 
(f P (x, z)f p (z', y) - f p (x, z')f p (z, y)) 2 dz dz' dx dy 

<c . 

- t 2 / 3 log(l/p) 

Let M p := JJ f p (x,y)dxdy. For each x, let m p (x) := M p J f p (x,y)dy. 
An application of Jensen's inequality to ([28]) gives 



// 



C 

( fp(x, z)M p - Mpmp(x)m p (z)) 2 dz dx < f2 /3 l og (l/p) ' 



By (US}, M p > (6t) 2 / 3 . Thus, m p is bounded by {Gt)' 1 / 3 , and 

C 



(29) J J (f P (x, z) - m p (x)m p (z)) dz dx < ^ ^ . 

For each p, let n p : [0, 1] —> [0, (6i) -1 / 3 ] be a step function (i.e. a function 
that is constant on intervals) such that 

(30) /KM-^x))^,, 

(Such functions exist because we can approximate m p by a continuous func- 
tion to any degree of accuracy by Lusin's Theorem [26], and then approxi- 
mate the continuous function by a step function.) 

Let a p be a measure preserving bijection of [0, 1] such that n p (a p x) is 
a non-increasing function. Such a bijection is easy to construct because 
n p is a step function. Let £ p (x) := n p (a p x) and g p (x,y) := f p (a p x , a p y) . 
By the monotonicity and uniform boundedness of £ p there exists a sequence 
{Pi}i>i decreasing to zero such that £ p . converges in L 2 to a limit function I. 
Therefore by (f29|) and (f3Q|) , g Pi — > g in L 2 , where 

g(x,y) := £{x)i{y). 
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By this and (j27[) . g is a {0, l}-valued function. It is not difficult to see 
from this and the non-negativity of I that I must also be {0, l}-valued. (If 
£(x) {0, 1} on some set of positive measure, then there may be a set A 
of positive measure where £(x) G (0, 1), or there may be a set A of positive 
measure where £(x) £ (l,oo). In either case, g(x,y) {0,1} on A x A.) 
Since i is monotone decreasing, it follows that i must be the indicator of 
an interval of the form [0,6] for some b € [0,1]. Lastly, (126p implies that 
JJ g(x,y)dxdy = (6i) 2//3 , and therefore b = (6t) 1//3 . Consequently, g = xt- 

The above argument establishes that for any collection {/ p } p >o such that 
f p € F* t for each p, there is a sequence {pi}i>\ decreasing to zero such that 
f p . — > xt in the cut metric. The same argument can be extended to show 
that for any sequence {f Pi } such that pi — > and f Pi € F*.^ for each i, there 
is a subsequence converging to xt in the cut metric. This proves the second 
assertion of the theorem. The last claim of the theorem follows from this 
and Theorem 14.11 □ 

4.8. Open questions. There are many questions that remain unresolved, 
even in the simple example of upper tails for triangle counts that has been 
analyzed in this section. For instance, what is the set of optimal solutions 
of the variational problem (|15p in the broken replica symmetry phase (i.e. 
where the optimizer is not a constant)? Is the solution unique in the quotient 
space W, or can there exist multiple solutions? Is it possible to explicitly 
compute a nontrivial solution of (I15p for at least some value of (p,t)7 Is 
it possible to even numerically evaluate or approximate a solution using a 
computer? Does Theorem 14.31 characterize the full replica symmetric phase? 
If not, what is the exact phase transition boundary? What happens in the 
sparse case where p and t are both allowed to tend to zero? At the time of 
writing this paper, we do not know how to answer any of these questions. 
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